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ABSTRACT 


A model for the auxiliary memory function of a 
segmented, multi-processor, time-shared computer 
system is set up. A drum system in particular is 
discusse@, elthough no loss of generality is implied 
by limiting the discussion to drums. Particular attention 
is given to the queue of requests waiting for drum 
use, It is shown that a shortest access time first 
queue discipline is the most efficient, with the 
access time being defined as the time required for 
the drum to be positioned, and is measured from the 
finish of service of the last request to the beginning 
of the data transfer for the present request. A 
detailed study of the shortest access time queue is 
made, giving the minimum access time probability 
distribution, equations for the number in queue, 
and equations for the wait in the queve, Simulations 
were used to verify these equations; the results 
are discussed, Finally, a general Markov Model for 
Queues is discussed in an Appendix, 


Thesis Supervisor: J. B. Dennis 
Title: Associate Professor of Electrical Engineering 


SOTERA iy ool ST RTM ar eaten a LEST yi ce ER RR AY See er oe YT RB chee tee ge 


ACKNOWLEDGEMENT 


I wish to express my appreciation to Project 
MAC for the use of its computational facilities, without 
which this paper would not have evolved; to Professor 
J.B. Dennis, my thesis advisor, for the many hours 
spent in discussing the ideas contained here; to 
A. L, Scherr for the use of his simulation language 
and for the many useful ideas that he gave; to 
Professor A, W, Drake, Professor D, C. Carroll, 
A. A. Smith, and E, Van Horn for constructive and 
enlightening criticisms; and to my wife Anne for her 


patience and encouragement, 


TABLE OF CONTENTS 


CHAPTER 1, INTRODUCTION. ........00-ccccccceccccceseece 6 
CHAPTER 2. BACKGROUND, .....cccccc cance cccccacccesscees 9 
CHAPTER 3, “THE: DRUM SYSTEMS. <.csiecc- secon ses veneesevecene 1? 
Fol. Introduction. ...ccccccccncecsccccccccseresssve LP 
3.2. The Users’ Processes Model...cccccerecceccvecs 18 
Fed. The Queue. .cccccccccccccvccvevcccssccccssesevse 29 
3.4%. The Drums...... ccc cee seccccccceveccscvevesces 40 
CHAPTER 4&, THE SHORTEST ACCESS TIME QUEUE.........-002 46 
Bide) INbYOGUsthOnisu sy « nisiees bee sns aes che besa aalweanee, 0 
4,2, The Minimum Access Time Distribution.......20. 47 
4,3. Examination of the Shortest Access Queue...... 56 
CHAPTER 5, THE SIMULATION RESULTS. CONOLUSIONS....... 65 
5.1, Introduction... ......c ccc cccc rec cccsccccecccces OF 
5.2. The Drum Simulation. .......ccceeccccscccccecee 66 
5.3. Shortest Access Time Queue Simulation.....cce. 74 
5.4, Conclusions sss scacisciac cceacvediedeceesseesaeea 76 
5.5. Suggestions for Future Study...ccsecscccccesce 77 
APPENDIX 1, THE POLLACZEK-KHINTCHINE FORMULA. ....ccee02 79 
APPENDIX 2, WAITING TIME IN A SHORTEST ACCESS QUEUE.... 87 
APPENDIX 3, DESCRIPTION OF A MIXED-POLICY QUEUE........ 91 
APPENDIX 4, A CONTINUOCUS-TIME MARKOV MODEL,......-ese0e 99 
BIBLIOGRAPRY 3's occie cvs site hue Na par eee tenes iteceee as kOS 


Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 
Figure 


Figure 
Figure 
Figure 
Figure 


LIST OF ILLUSEPRATIONS 


2.1. One user in a segmented system.....cseeeecee 
3.1. The overall System Model........e ec ecescceee 
3.2, Read and Write requestsS.....c-ccesesccccsene 
3.3. Number of Segments per request... ..csesescae 
3.4%, Number of pages per Segment....ccrcccssccces 
3.5. Inter-request times........ ccc eee cc rencenee 
3.6. Requested drum positions.........sseceevevee 
3.7. Requested Grum..... cc ear car cccercecvccvcves 
3.8. The Processes Model.......cccecesvocecceccce 
3.9. A structure for the queue.s.cccccccsccscccer 
3.10. The Queue Model... cc ccceccrcccerevesvevcves 
3.11. The Organization of the Drum.......-.-scece 
3.12. The Delete Mechanism....csrsccscccccencvces 
3.13. The Drums Model... ccccccsccccccccrcereasvces 
4.2.1. Operation of the Shortest Access Queue.... 
4.2.2, Access time for an TEQUESE..cccaccescrece 
4,2,3. Shortest Access time Distribution.......0. 
4.2.4, Mean AccesS time......cceccecceesercsrenes 
4.2.5. Standard deviation of access time.....cccee 
5.1. Waiting times in queu€......ccsecccnncescces 
5.2, Number in queue... secs secs cnccrcuccrccvee 
5.3. Number of fields used per sector...e.ceosees 


A3.1. A structure of a mixed policy queve.....ce6 
A3.2. Position of tageed request... .ceccscccrcccses 
A3.3. Window movement.....ccecscresccvcsvcccccece 


A3.4. Comparison of window movement and 
tagged request posSition...ccrcecsscccces 


CHAPTER I. INTRODUCTION. 

: With the advent of more and more complex computing 
systems it has become increasingly important to have some 
reliable means for evaluating the performance of the system. 

In the Compatible Time-Shared System (CTSS) at Project MAC (2), 
M.I.T., for example, the scheduling of users is a problem 

that is receiving much attention, Patel (14) has considered 
first-come-first-served allocation of processor resources 

to users, and a multiple-level dynamic priority scheduling 
algorithm which closely models the scheduling algorithm used 

in CTSS (2), Heller (10), on the other hand, has considered 
the more general problem of a multiple-processor time-shared 
system. The purpose of the scheduling algorithm is to allocate 
the processor resources as efficiently and equitably as 
possible, minimizing processor idle time and user waiting 

time, Various schemes for scheduling have been tested at 

MAC but the one described by Patel has proved most satisfactory. 
Scherr (17) has made a far-reaching study of CTSS-like systems, 
with particular emphasis on their Markovian aspects, 

Before the user's waiting time can be minimized it is 
necessary to minimize the processor idle time. One of the most 
inefficient operations is the swapping of information between 
the core memory 2nd the drum or dise files, Oftentimes the 
processor must stand idle during a swap, awaiting the arrival 
in core of a block of data, One way to ease this diffioulty 


is to use 


one or more processors and let several programs occupy core 
at once. Then during the time that the swapping for one 
program is taking place, the processors can be kept busy on 
other probrams, In this way overall processor idle time can 
be reduced. These ideas of multiprogramming and multiple 
processors are not new; it is only recently that computer 
hardware has become sufficiently sophisticated to handle the 
task effectively. 

Additional alleviation of the swapping problem can be 
effected by making drum and dise file operation as effieient 
as possible. In single-program systems efficiency of drum 
operation is not a problem since only one program (the program) 
can demand use of the drum at a time, Clearly, in a mlti- 
programmed system several programs can make simultaneous 
demands on the drum and disc facilities, making special 
organization a must to minimize the waiting time of a given 
program for its reauest to be serviced, and at the same time 
minimizing idle time of the entire system, It is clear 
that in a poorly organized drum system the inefficiency of 
the drum system can seriously impair the operation of the rest 
of the computing system because continued operation often 
depends on the reading of informtinn into core: a program 
cannot begin to operate a segment until that segment has been 
placed in core. For instance, suppose we had at our disposal 
the means to reduce the average service time of a drum request 


by two or three milliseconds. In the two or three milliseconds 


eens a 


saved much computation can be performed, 

In this paper we consider a model for a drum file 
memory system, and in particular a model for the programs 
in such a system. The model will describe the manner in 
which a program (or more properly, a process) makes requests 
for file memory use. A computer simulation has been written 
for the particular model described. In Chapter 4 a pertinent 
mathematioal model is given. In Chapter 5 the results obtained 
from thie model are compared with the results obtained from 
the simulation, The interested reader is referred to 
Scherr (17) and to Appendix 4 for an outline of the complexity 
of even the most tractable of models, the Markov Model. 


CHAPTER 2, BACKGROUND. 

It is the purpose of this section to discuss some 
of the concepts upon which this paper is based, One of the 
problems of existing time-shared systems is that the processor 
must stand idle while the present and next user's programs 
are being swapped in or out of core, One proposed solution 
to the problem is to run one user's program in core, meanwhile 
swapping the next user into a remaining part of core. Then 
the processor would be switched to the next user, and the 
swapping operation would bezin anew, Of course each user 
would not be arbitrarily assigned half of core, but programs 
would be matched in some cpmplementary manner long ones with 
short ones. This mode of overlapped operation in a time-shared 
system ts sometimes referred to as a ring (cf. Scherr (17)). 
Again, idle processor problems arise if one program should 
require all available core space. Then no simultaneous 
swapping could take place, 

A generalization of the above solution to the problem 
has been considered at length by J.B. Dennis and E Van Horn (3,6). 
It 1s known as segmentation. Under this scheme a user's 
program would be divided into a set of individually named 
parts, called segments, The user is assumed to have segmented 


his program in the way which seems most appropriate to him. 
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Segments may be classified roughly according to the 
Manner in which they may be accessed: 

(1) Read-Only... 

(2) Data, 

(3) Pure procedure, 
Some combinations of these classes are permitted, 

A pure procedure segment is a set of instructions 
which directs a process* to operate on data but not on itself. 


Thus we could ask the compiler to extract all the symbols, 
variables and so forth, from a program and group them into 

one segment; procedure segments would then be allowed to 
modify and use this data. Of course certain programs, notably 
short ones, would be contained entirely in one segment. 


Read-Only data might be input data, format specifications, ' 


‘and so forth, which are not altered by the processes in a 


user's computation, Operation of a program might be in the 
following manner, Some first segment would be brought into 
core, together with all necessary data segments, which may or 
may not include read-only data, Then segments may act 

singly or in groups (if several processors are available) 

on the data, New segments are brought in as needed (when 

a reference is made to a segment not already in core). The 
programmer may wish to declare subroutine segments, 


which might contain some of 


* 
A process is carried out by a processor under the direction 
of instructions in procedure segments, (3,4.7). 
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his often-used subroutines, and which for efficiency’ sake 
should be kept handy in core at all times. Of course certain 
subroutines, such as printing or exponentiation subroutines, 
might be kept in special common, or library segments, being 
available for the common use of all users. In this way each 
user would not need to be given his own copy of each and 
every library routine, Figure 2.1 suggests the operation 

of the system, showing a time sequence of groups of segments 
operating on data, The time sequence may not be in the order 
in which the segments were written, and the same segment may 
appear many times in the sequence. Several processors might 
be available to work for one user, so that several segments 
might be active at once. Note that we have indicated that 
the segments are in general of various lengths. Note too 
that read-only data may not need to be present in core (main 
memory ) but may be referenced from, say, a drum memory 
(auxiliary memory) as needed, 

Clearly, by writing programs in segments, only a few 
segments of a given program need be in core at once, the rest 
being stored in auxiliary memory, perhaps on a high speed 
drum. A segment in core which is being used by one or more 
processes is called a working, or active segment. Segments 
kept on the drum are called dormant segments. Many users, 
or course, can have segments working simultaneously if there 
is more than one processor available, When a segment is 
working it can have one or more processes taking place in it, 


depending again on how many processors are available to work 
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Figure 2,1. Operation of one user in a segmented systen, 
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on it, Hence when talking of the computations within a 
user's set of segments, we shall speck of a user’s processes 


rather than a user's program, 

Fach segment will be named in some arbitrary manner, 
When a process makes reference to a segment (by naming it 
and giving the address of some word within it) which is 


not in core, that process is temporarily suspended until 

the required segment is brought into core. Since many users 
may have simultaneous press sues it will be necessary to have 
some central control over allocation, swapping, and so forth. 
The program whioh does this job is called the Supervisor 
program. When a process references a segment other than the 
one in which it. is taking place, the Supervisor will transfer 
control to that segment if it is in core. Otherwise that 
process must halt until the Supervisor has brought in the 
needed segment, With many processes running there will be 

a@ great demand for drum usage. We think of a process causing 
a réquest to be made to the drums for information, rather than 
the process itself making the request, We can see that 
references to other segments are at arbitrary points in time, 
and may be to arbitrary segments, whioh may have arbitrary 
length. 
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If requests should be generated momentarily faster 
than they can be serviced, then the waiting requests must 
be places in a waiting line, or queue. The order in which 
requests are serviced (i.e., the order in which they leave 
the queue) is not necessarily the order in which they arrived 
at the queue. We can see three distinct parts of the data 
transmission function of the computing system: the users’ 
processes, which generate requests (either to read or to 
write on the drum); a queue into which requests that have 
to wait are placed, and which has a selection rule for next 
out, called the queve discipline; and finally the drums. 

One final word must be said, concerning the trans- 
mission of data to and from the drums, It seems both desireable 
and convenient to have some standard unit of transmission 
and allocation, which we call the page. It is always 
possible to store pages consecutively on the drum (see Section 
3.4). This requires that there exist some mechanism for 
deleting unnecessary data from the drums, One possible 
mechanism, using a percentage level of drum occupanoy, 1s 
discussed in Section 3.4. It is necessary for the Supervisor 


to maintain some level of drum occupancy, and to have a 
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deletion policy in order to keep the drum from overflowinc. 

We will see under our study of queues in Seotion 3.3 
and Chapter 4 that for each request there is a certain drum 
positioning time, or access time, that must pass while the 
requested starting page comes opposite the drum's read-write 
heads, This access time is wasted time. We seek to 
minimize it, 

There are two general methods of handling core allocation, 
and it is not clear which method is more desireable, One 
method is called page-turning, the other segment-turning. 
Under both methods, a set of pages will be grouped as a 
segment and given a name, Under segment-turning a whole 
segment is brought into core and kept there at least until 
the various processes are finished with it. Under page-turning, 
one page of a segment at a time is brought in, and a new 
page is brought in only when needed, Under page-turning 
unneeded pages are deleted singly, while segment-turning 
deletes the entire set of pages belonging to a segment if 
any one of them is deletable, Page-turning seeks to minimize 
wasted core space; segment-turning seeks to minimize overall 
processing time per user. Each method has its advantages 
and disadvantages. There is some evidence that neither is 
better (cf Scherr's Thesis, where it is shown that the 
scheduling and computational time quanta do not significantly 
affect system operation (17)}. This paper assumes a segment- 


turning system, 
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In conclusion: when a user's process refers to 
another segment that is not present in core, it will cause 
the Supervisor to generate a request to the drums. Ordinarily 
& request will be a read request, but it might also be a write 
request if the referenced.segment is one in core being declared 
in the reference as “dormant"; or it may be a delete request 
if the referenced segment is being deolared "dead", The 
queve will contain the waiting requests, while the drums will 
service them. A proper deletion policy is needed. Finally 
it is clear that the unit of information transmission ought 
to be the page, but the core memory allocation question, namely 
whether to allocate in pages or in segments, is open for 


discussion, 
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CHAPTER 3. THE DRUM SYSTEM. 
3,1. Introduction. 

The system model described here consists of three 
elements: the Users’ Processes, the Queue, and the Drums. 
The Users’ Processes element models requests to the Drums 
to read, write, or delete. The Processes will make requests 
at certain intervals given by some inter-request-time probability 
distribution; they will request some quantity of data in units 
of pages, beginning at a specified location on the drum, Several 
drums may be present, so each request will specify which 
drum is involved. Delete requests will be sent directly to 
the drums, while read and write requests will be entered in 
the Queue. The Queue will contain a list of which processes 
are requesting how much data from (or to) what drum, andthe 
starting location of the drum, It will act according to 
some queve discipline to decide which request is next to reach 
the drum, and will assign the request to a free channel to 
the requested drum. When a request is assigned to a channel it 
is deleted from the Queue, When a drum is notified by the 
Queue that 3 request is assigned to a channel it takes note 
of what program has been assigned to the channel, what the 
desired starting location and field are, and whether the 
request is a read or a write, A certain amount of time 
must elapse before the desired location has revolved into 


positions this time is the access time. Once the desired 
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starting position has come opposite the drum heads the data 
transfer begins, and ends after a certain amount of time, 
the transfer time, has elapsed, The sum of the access time 
and the transfer time is called the service time. The 
chanel idle time is the time during which the channel has 
no request assigned to it. There may be some question whether 
access time should be included in channel idle time. Since 
access time directly affects a given request's wait before 
the end of its service, we h-ve included it in the service 
time. Figure 3.1 shows the system in block diagram forn, 
as we have just outlined it. 

We now zive a complete description of each element 
starting with the most basic, and most probabilistic, the 


Users' Processes. 


3.2. The Users' Processes Model. 

In order for proper control of all computing facilities 
to be maintained, the individual processes in core do not 
make requests directly to the queue and drums. As discussed 
in Chapter 2, a request originates from the Supervisor, the 
program whtch controls allocation and proper operation of 
the system facilities, The Supervisor can prevent interaction 


between processes, providing protection against such 


USERS! 
PROCESSES 


drum requested 

starting position 

process making request 

no. pages in transmission 


6T 


Figure 3,1. The Overall System Model, 


happenings as some process erroneously requesting to write 
on top of another's information. The Supervisor will contain 
the queue. 

In order to vromote efficient operation, program seg~- 
mentation will be used (3,6). By breaking the program into 
segments, efficient use can be made of core memory, since 
those segments of a program in which no processes are presently 
taking place should be stored on the drum and should not be 
"cluttering up" core. When a process references a segment 
not in core, the Supervisor will request that the next 
segment or segments be brought into core. Clearly while the 
next segment or segments are being read into core, any waiting 
processes are suspended; hence our first assumption: 


Assumption 1. Once a process has caused a request for 
one or more segments to be read in, it is temporarily 
syspended until its new segments are brought in, In 
particular a process will be unable to cause further 
requests until at least the time when it is resumed, 


On the other hand, during the course of computation a 
process may generate some output data in core and request 
that this data segment be stored on the drum, for example 
so that it can reuse the same core space for further data, 
Such write requests do not imply that the process must come 
to a halt, hence our second assumvtion: 


Assumption 2. Upon generating a write request a 
process may continue, and in particular it may cause 
further read or write requests while a write 

request is being serviced. 


21 


From the above discussion, we may expect that a read 
request is more probable than a write request, and so our 
third assumption: 


Assumption 3. The probability of a process causing 
a@ read request is not the same as that of it causing 
a write request, and in general the probability of 
a read is greater than that of a write. 


In order to simplify space allocation on the drums, 
the surface of the drum will be divided into blocks, or 
pages, consisting of some fixed number of words, Thus the 
number of words per page is fixed, and 


Assumption 4, The unit of information transmission 
and storage will be the page, 


We have no reason to assume that the number of pages in an 
arbitrary segment is fixed; in fact all we can say is that 
long segments (those with many pages) will be unlikely as will 
extremely short segments (for example one or two pages). 

The number of segments in a block of n pages 1a @ random 
variable, and in particular the probability of finding 

exactly n pages in s segments may be given by a discrete 


Poisson Distribution: 
P(syn) = (AAHE g-m/W 20,142,404. is 


: B=1,2,3g006 
where the mean number of pages per segment is N. 
Consider this problem: if a process should reference 
more than one segment not in core, so as in initiate the 


read-in of several segments, should the Supervisor ask for 


a ee ae a pe en pe ene a eee mena Son RT ere enn creme 
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the several segments in a single request, or should it 

make separate requests, one for each segment? We are assuming 
that it is always possible to store the pages of a given 
segment sequentially on the drum, that is thst we can always 
read or write a segment without interrupting the transmission 
between start and finish. How this is done is considered in 
some detail in Section 3.4, For three reasons we argue that 

in the event of need of several segments contemporaneously 

' there should be a separate request made, one for each segment. 
First, since consecutive segments may not be all written 

at once, but may have been written at widely spaced intervals, 
and independently or each other, it is unreasonable to assume 
that segments will always be stored consecutively; although 
this could be done by the method of Section 3.4. Second, 
there is no assurance that the requested segments will all 

be on the same drum, or that the request will even be for 
consecutive segments. Finally some queue disciplines 
discriminate against long requests, servicing those requiring 
the shorted service times first (Section 3.3); asking for 
several segments in one request could well result in an 
inordinately long wait for service under such a queue discipline, 
We now make our fifth and sixth assumptions, 


Assumption 5. Each request will be for one segment, 
but at a request time a process may cause several 
requests. The probability that s segments will be 
requested will be exponential, that is 

P(s) =e ® 8=1,2,... (2) 
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Furthermore at request time there 1s no reason for 
all the requests to be either all read or all write; 
they may be mixed, A read request, or course, will 
cause suspension of the process. 


Assumption 6. The number of pages in the single 
segment of each request will have probability of 
being n pages 


P(n) = 52 e7n/N =0 51,2500 (3) 


where N is the mean, 
When a segment is active, that is, when processes are 
referencing it, the probability that the next requests occur 
at each successive time instant are independent so that we 
expect the arrival times or requests to be Poisson 
Distributed. A request is unlikely to be made immediately 
after resumption of a process from the last request, and it 
4s unlikely to be made an extremely long time after the 
resumption of a process, The probability of exactly k 


requests in a time interval t is 


k 
P(k,t) = —— ent t>0 (4) 
where a is the average number of arrivals per unit time. 
We have then 


Assumption 7. The inter-request times are taken from 
the following distribution* 


-at 


P(t) = ae t2>0 (5) 


*See page 60, 


24 


Something must be said about the starting position of 
the drum a particular request will seek. We have no information 
to allow us to assume anything other than that all drum 
positions are equally likely to be requested. 


Assumption 8. At a particular request time all drum 
positions are equally likely to be selected; that is, 
the density of angular positions requested will be 


P(0) = 0<0<a2n (6) 


Finally something must be said about which drum is to 
be requested, in the event that there are several drums in 
the system. When a process is making requests for several 
Segments there is no reason to assume that all the requested 
segments will be on the same drum, Hence we are willing to 
say that each of the D drums is equally likely to be requested: 


Assumption 9. Egch request is equally likely to 
be for any of the drums in the systen. 


Assumptions 3,5,7,8, and 9 are illustrated in Pigures 3.2 
to 3.7. 

Based on the discussion above, we are in a position to 
construct a model for the request activity of a given 
process, This model is shown in Figure 8,* 


*A Note on Notation: A fork is a point at which one process 
splits into two processes, which follow their own paths. A 
join ia just the opposite, where two processes become one; each 
time the join is entered the pperations in the box of the 
flow chart are carried out. An arrow doing this 
4s a termination of a process. A note in brackets gives the 
condition. permitting a process to emerge from the corres- 
ponding box, A function written with an argument (.) denotes 
a probability function for a set of identically distributed 
random wariableg. 
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rdwr(.) 


Figure 3.2. Relative frequencies of read and 
write requests, 


Figure 3.3. Relative probabilities of number of 
segments per request, 
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Figure 3,4. Relative probabilities of number of 
pages per segment. 


tra(.) 


Figure 3.5. Relative probability of inter- 
request times, 
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Figure 3,6. Relative probability of requested 
drum position, 
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Figure 3.7. Relative probability of requested drum, 
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nseg(.)—ens 


Send npg(.),drmpos(.), 
drm(.), rad to Queue. 


request 
finished 


Send drmpos(.), Ny 
wr to Queue. 


request 
finished 


Wait until. time = time + w 


*The delete mechanian 
4s disoussed in Seotion 


nseg(.) = no. of segments requested 
réwr(,) = rd-wr distribution 
npet -} = no, pages requested 
drm(.) = requested drum 
drmpos(.) = requested drum position 
neno, pages this request 
ns = temporary segment count 


Figure 3,8. The Processes Model. 
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The reader may be asking what justification there is 
for assuming the particular probability distributions that have 
been chosen; in partioular why we have chosen Poisson 
distributions as opposed to other distributions, It will be 
noted that these choices are completely arbitrary, and cannot 
be properly determined until some statistics are available 
about the system we are discussing, It is felt that the 


assumptions that have been made are reasonable, 


4.3- The Queue. 

The model of the queue is more straightforward and 
deterministic than the model of the processes, When a request 
is received from a process it is entered in a list within the 
Queue Element. Each entry in the list contains the following 
information: an identification number of the process requesting, 
the number of pages involved in the transmission, the desired 
starting location on the drum, the identification number of 
the desired drum, and whether the request is a read or a write. 
The number of pages is an important plece of information since 


it can be used to determine when the transmission is ended. 
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A possible structure for the Queue's list, which we 
will refer to simply as the queue, is shown in Figure 3.9. 
In this list two pointers are used, one to indicate the lower 


limit of the number in the queue (the shaded region), the other 

to indicate the upper limit. Both pointers are periodically 

incremented and are modulo capacity of queue. The lower pointer 

is moved down one position each time a new entry is made, and 

the upper pointer is moved down one position each time a 

request leaves the queue. If the next out is not the least 

recent entry, then all items above are moved down one position 

to fill the gap, The shaded area represents the number in 

the queue, frequently referred to as the length of the queue. 
There is a Boolean signal received from each of the 

drums indicating whether or not that drum is busy (all 

channels to it in use). Whenever all channels to a drum 

are busy, any requests arriving for that drum must wait in 

line, end a waiting line, or queue, is formed. If requests 

arrive too much faster than they can be serviced, the length 

of queue could become equal to its capacity and any further 

requests will be lost. Such a development is disastrous, 

since it would render a process useless, Hence the average 

arrival rate must not exceed the average service rate, where 

the rates are defined to be the reciprecals of the 


average interarrival and service times, respectively. 


least recent 
entry —>} 


LILO LD LLL number capacity 
next out —€—— in of 
LLIL IL LED queue queue 


latest entry LEA 


next entry —” Direction 


of push 


Figure 3,9. A structure for the queue stack. 
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When the Queue is aware that a drum is not busy, it 
looks down the list to determine which if any requests want 
the free drum, It then chooses one of them according to the 
queue discipline, assigns it to some free channel to that 
drum, the deletes the entry from the list. 

The queue discipline is simply the rule for selection 


f next out. We consider four queve disciplines applicable to 


our situation: 

(1) Pirst come, first served. 

(2) Shortest access time first. 

(3) Shortest Job first. 

(4) Mixed policy. 

(1) First come first served. 

This is the “fair® or “equitable” queue discipline, 
where requests are serviced in the order of their arrival, 
and is the case when the “next out" of Figure 3.9 is the 
“latest entry". It does not result in the most efficient 
operation. It is analogous to the normal situation encountered 
in a post office, when one wishing to buy a single stamp 
must wait behind a person with several packages, Certainly 
the waiting time is greatly increased because af i111 fortune, 
whereas the person ahead would not be significantly delayed 
to give way. Since a process is equally likely to ask for 
any drum position, and since the present drum position is 
likely to be anything, with the first come first served queue 


discipline the average access time is half the drum revolution 
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time. Lab us represent the time a request is in the service 
system (the time from when a process makes a request until 
the time service is completed) by Tye Let the drum revolution 
time be T, Let the average transfer time be T,. And let 
the average wait in the queue be Wye Then for first come first 
served, 

Trew + T, + t/2 (1) 


(2) Shortest Access Time First, 
Under this queue discipline the next out is selected 


according to following rule: 


Choose the one for which the rotational positioning 
delay until the desired starting address is minimum, 


Now if more than one request for a given drum is in the queue, 
on the average the access time will be less than half the 
drum revolution time; this is so since with more in the 
queve the probability that there is a request for the present 
drum position is greater than for a queue of length one, It 
will be shown later that the minimum access time is roughly 
inversely proportional fo the length of the queue. Hence 
for this queue discipline 

Tom W +7, + T/n (2) 
where n is the average number in the queue, and Wq is 
not the same numberically as for the first come first served 
queue with the same nN. Observe that the shortest access time 
queue is a dynzmic priority queue, one for which the priorities 


of requests are changing randomly. 
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(3) Shortest Job First. 
Under this queue discipline the following rule is 
used to select the next out: 


Select the request for which the servioe time is 
& minimum, The service time is the sum of the access 
time and the transfer time. 


A little thought should convince the reader that under this 
queue discipline the access time is not minimized, but yet 
it will in general be less than T/2 for queues of length two 
or more, Hence 


T+ W, + T+ 7 (3) 


where T/n <. f' < 7/2 , and Wg 4s not the same numerically 
as for either a first come first served or shortest access . 


queue having the same length n, 
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(4) Mixed Policy Queues. 
It will be noted that the shortest access time queue 


and the shortest job first queue are queues in which a continuous 
number of priorities exist. Suppose we become concerned 

about requests which might have to wait an inordinately 

long time, perhaps because of i11 fate, perhaps because its 

job time is long. This could be a real nvobies in the shortest 
job first case (what of the longest job of all?7). It ought 

not to be too much of a problem in the shortest acoess time case, 
since each time a request is to leave the queue, it has an 
equal chance among all the others of being chosen, This is 

only partially trne for the Shortest Job Pirst Case, where 

the job time has a random component, the access time; and if 

the transfer time component be very long, then the job time 
depends almost entirely on the transfer time. Notice that for 
very short transfer times, the shortest job queue will approach 
in operation the shortest access queue. One way of circum- 
venting the problem of some request waiting inordinately long 

is to introduce a skip limit into our model. Each time a request 
is skipped over as next out, a counter assogiated with that 
request is incremented, If this skip count ever exceeds the 
akip limit then this request is next. What we have done 

in essence is to add a first come first served component. to 

the queue. As the skip limit is lowered a- shortest access time 
or shortest job queue behaves more and more Like a first come 
first served queue, In fact a queue with the skip Limit set 


to zero is just a first come first: served queue. 


te ctr GR RE Stn can ee ore ae eae erg een Spree ger nee ERIN ema pt NN meena 


36 


We will include no more discussion on mixed policy 
queues since the problem is in general complex and unsolved. 
We will, however, mention the skip limit once again in Chapter 5 
under the discussion of simulation results, Finally, another 
mixed poliay queue is discussed in Appendix 3. For further 
discussions on the matter, the reader is referred to the 


literature (1,9,13,15,16). 


(5) A Comparison of Queue Disciplines. 
In Appendix 1 we have related the mean number in the 


service system, which includes those being serviced and those 
in the line, to the mean and variance of the service time 
distribution. We will denote the mumber in the system by L. 
The random variable of the service time, tes is the access 


time, tes plus the transfer time, t The service distribution 


t° 
‘ean be found from a sonvolution of the access distribution with 
the transfer time distribution, We will show later that 

both of these can be found, hence the service distribution 


‘can be fourd. In particular, the mean service time, T._, is 


8 
To2T& + (4) 
And the variance of the service time, 32 » is 
2 2 2 
gq 5 top tas (5) 
since we are assuming independence of a and tee Let us 
. denote the function relating L, Ty and of by 


L= F(T,,0% ) = F(t,7,, 024 0?) (6) 
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The function F from equation (6) 48 such that a decrease. 


2 
8 


4n L. The transfer time distribution will remain the same 


in either or both of T, and co” will result in a reduction 
for all queue disciplines since it is a function of the number 
of pages per segment, which is fixed before hand, and is 
assumed to be identical for all processes, 
Let the total number of processes in all be N. Then 
Newel (7) 
where w is the number of working processes, The efficiency 
of a svstem can be measured crudely by the number of processors 
working, and is 
efficiency = tt = Rat zl. b (8) 


To maximize the efficiency, L must be minimized. Thus the 
optimum queue discipline is the one for which is minimized, 
Notice further that the number of processors that can be 
kept working is just the number of working processrs? 

number of busy processors = w= Nfl - #) (9) 
Por the simplest system, the single=-processor system, w must 
never be less than one if the processor is to be continuously 
busy. 

On the average the transfer time is the same for all 
queue disciplines (because it relates directly to the number 
of pages in a segment). On the average the access time is 
explicitly: minimized nly by the shortest access time queue, 
Therefore the service time for the shortest access queue will, 


on the average, be a minimum, compared to other queues. 
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We see that the shortest access time queue minimizes the 
service time, while the queue which bears the “shortest job 
first" does not minimize the average service time, The 
apparent contradiction is resolved when we realize that the 
shortest access queue chooses the shortest job first on the 
average, while the “shortest job first" queue selects the 
shortest instantaneous job. Nevertheless equation (6) tells 
us that the shortest access queue must have minimal L associated 
with it, and is therefore the most efficient*. In fact, 

any queue which does not minimize the access time must be 
less efficient than a shortest access time queue, when 
efficiency is defined by equation (8). Chapter 4 1s devoted 


to a detailed study of this queue. 


Based on the above discussion, we give the Model of the 


Queue in Figure 3.10. 


*This conclusion is verified by simulation, See Chapter 5. 
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Wait for request 


Store pvrogno, drm, drmpos 
npg, rd or wr, in queue. 


YES 


all drums busy? 


Seleot next out according 
to queue discipline and 
skip limit, if any. 


Send drmpos, npg, progno, 
rd or wr, to drum no, i. 


npg = no, pages requested 
rd = read request 
wr = write request 
drm = the drum desired 
drmpos = the starting position 
s 


drmbsy(1) = indicates ith drum busy 
nq= number in queue 


Figure 3,10. The Queue Model. 
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3.4. The Drums. 

Since the method of distributing pages on the 
drum is of considerable importance, we will discuss it first. 
Consider Figure 3.11, The drum, we suppose, is divided into 
seotors as viewed from a cross-section, where the number of 
sectors is an integer. The number of words per page is just the 
number of words that can be written around the ciroumference 
of the drum divided by the number of sectors. The drum is 
divided into rings, and the width of one such ring is a fie]d. 
A field is one word in width, and a word is typically 36 binary 
bits, Each field is subdivided into a number of tracks, each 
of which is associated with one read-write head. The same — 
head is used for reading and for writing: a read amplifier 
or write amplifier is connected as needed. The operation a’ 
head is presently performing is called its status, and there 


is a delay associated with switching between read and write 


status. This selection delay is about the same time as for 
three or four words to pass beneath the head, so ordinarily 
the first few words on a page will be left blank to allow 
for this delay, 

It was stated previously that it is possible to write 
a segment of N pages on the drum contiguously. We indicate 
how this can be done. The question is: suppose some of the 
sectors in a field are used, how can a string of N pages be 


written consecutively, especially if a page would have 
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Shaded squares are a consecutive 
string of pages, each with a 
pointer to the next, 


Sectors 


p 


NG Ns Fields 


Wa ts Tracks per field 


Figure 3,11. Organization of the drum. 
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to be written on a used sector? The answer is that we 

do not attempt to write the pages in the same field. We 
require only that during a write operation there be at least 
one free field per sector. First of all, suppose that each 
sector was allowed to have all but C of its fields in use, 
where C is the number of channels to the drum, and where 

any channel can access any field. Suppose further that 
whenever fewer than C fields were free on a given sector a 
deletion occurred immediately. Then the drum could handle 

C simultaneous write requests because a free field can always 
be found. In reality a delete might not osour when necessary, 
and also there is the possibility that a segment is longer: 
than the number of sectors, which implies that more than one 
of its pages would be written on the same sector. It would 
be better to set a drum occupancy level, which is the ratio 
of allowed fields per sector to the actual number of existing 
fields per sector: 


occupancy level < Pet (1) 


where F is the number of fields per sector, C is the number 
of channels to the drum. Then whenever the occupancy level 
is exceeded, some sort of emergency condition would be set 
up, and any unnecessary segments would be removed from the 
drum (they would be deleted, or they might be moved to a 
lower level of storage, for example a disc file). In such 
a case the occupancy level would have to be less than the 
upper bound set by the equality sign in (1) to allow for 


statistical fluctuations, Simulation has shown that 
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ocoupancy levels in exoess of 93% are possible with a proper 
deletion policy, for typical parameters. See Chapter 5. 
When a write request comes, the pages are written 

on the drum on the first free field on each sector, and a pointer 
is left, to direot a read operation to the next field of a 
consecutive string of pages, These pointers are indicated by 
arrows in Pigure 3.11. Thus it 18 possible to have a string 
of consecutive pages written (and read) without interruption. 
We require rapid inter-field switching, a feature available 

on highs speed drums, It is to be noted that if the drum is 
be be operated thia way it will have to maintain its own 
“Field Usage Table* similar in principle to the "Track Usage 
Table* used in CTSS8 with the disc(2). When a write request 
arrives, this table is consulted to lacate the nearest 

free field on the given sector. 
. As long as the Supervisor's deleteion policy sees to 

it there are always sufficient free fields on each seotor, the 
drum operation is straightforward. The delete mechanism shown 
in Figure 3.12 determines how many pages are to be deleted from 
the drum; it does this whenever the desired occupancy level 

is exceeded, We may model the behavior of a deletionn by 
picking a random drum address and deleting one page from each 
sector until N pages are deleted. A “deletion" may be to 
_ remove the offending pages to a lower level of storage, or 


it may be to obliterate the pages entirely. 


Once a channel is assigned, the drum observes whether 
the request is a read or a write, and switches the heads associated 
with that field to that status, as soon as the starting sector 
is @pposite the heads. Note that the set of heads associated 
with a given field may be in use by different requests from 
seotor to sector, When the starting page is in position, the 
data transfer begins, allowing time for the switching delay 
at the top of each page. Three or four words left blank on 
a page is sufficient time for this. At the bottom of each page 
4s an End of Page mark, with a pointer to to the field containing 
the next page, which initiates switching to that field; of 
course the heads there are put into the proper status. 

Finally, at the end of the last page of thesegment, an'End 
of Pile mark will be encountered, and the channel is freed 


for the next request, 


The ideas for the drum model are embodied in Figure 3.13. 
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Figure 3,12. The delete mechanism. 


This empty page was substituted for a 
blank page in the original document. 
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Pigure 3.13. The Drums Model, 
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CHAPTER 4, THE SHORTEST ACCESS TIME QUEUE. 


4,1. Introduction. 


In Section 3.3 it was shown that the shortest access 
time queue discipline is the most efficient; it is the purpose 
of Sections 4,2 and 4.3 to analyze this queue as best can 
be done. The shortest access time queue is a special form of 
the shortest job first queue. Solutions have been obtained 
for shortest job first queues with the input rates independent 
of the queue leneth, No solutions have been obtained for a 
shortest jod queue in which the input rate is dependent on 
the queue length, that is, when there is only a finite number 
of requestors, In the next two sections we do not attempt 
to solve for the probability densities of queue length, 
waiting times, and service times; rather we talk only of the 
averages, which become time independent at equilibrium, when 
the input rate to the system is the same as the output rate 
from the system. In Section 4.2 we derive a probability 
density function for the minimum access time as a function 
of the mean number in the queve; then in Section 4.3 we combine 
these results with the results of Appendix 1 to obtain some 
approximate expressions for the number in the queue, and for 


the waiting time in the queue. 
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4.2. The Minimum Access Time Distribution. 


The access time is defined as the time from the exit 
of a request from the queue until the requested starting 
sector has come opposite the read-write heads, It is simply 
a@ positioning time. . We have shown in Section 3.3.5 that a 
queue discipline which minimizes the access time is the most 
efficient; we wish to derive the access time distribution 
in this section, and in a later section we will determine the 
waiting times in queue using the Pollaczek-Khintchine Formula 
(Appendix 1). We define the Sllowing quantities, given 
that n are in the queue: 


Ry = requested starting sector of the drum for the th. 


request in the queue. 
D(t) = The angular drum position at time t. 
A,(t) = required access time at time t for the 1°? 
qe ote) Siven that the present drum position 
is D(t). 


@ = random variable of minimum access time, which 
takes on values aoe 


T = drum revolution time. 

The model of the shortest access time queue discipline shown 
in figure 4.2.1 best illustrates what is going on. 

The comparators compare the reouested starting sector 
with the present drum position and give as an output the 
required access time. The Min(.) box selects the minimum of 
its inputs and sets its output to this value. To simplify 
the derivation we will assume that the Min(.) box normalizes 


its output with respect to the drum revolution time T, so 
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D(t) 
COMPARATORS 
A, (t) 
A, (t 
| Min(,) a 
Figure 4,2,1. Operation of shortest access queue. 


that a is a fraction between 0 and 1. We have 

a = # Min (A, (t),Ag(t),..,A,(t)] (1) 
where O< a<1., We are interested in the probability 
density of a as a function of n, the number in the queue. 

It was stated in Section 3.1 that the probability density 
of the R, 1s uniform, that is, all drum seotors are equally 
likely to be requested. Further we are assuming random 
segment lengths. If. segment lengthe and starting positions 
are random, the present drum position, which 41s the drum 
position just at the finish of the last request (so that 
the next request is about to be assigned), is random, and by 
symmetry and the independence of requests, we may assume that 


it is uniformly distributed.* 


*This is not true in the case of short segments because the 
drum will have rotated only a short diatance, This matter 
is discussed further in Section 4,2, page 63. 
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But if D(t) and R, for each 1 are uniform, then A, (t) must 


be uniform for each 1; that is, the 1° 


request's access time 
4s equally likely to be any fraction of a drum revolution. 
Figure 4.2.2 shows the density function for Ay(t), which has 


been normalized with respect to the drum revolution time, T,. 


P, (t/t) 
Ay 


a, 1 fr 


Pigure 4.2.2. Access time for th request, 

Now, the probability that a > a, is. just 
t 

Pla > a.) = PLA, (H)>a,,..A (Mra) tt = 
But the R, are independent, s0 that the A, (t/T) are also 
independent, and 

Pla > a.) = PlA(t*)>a,]...PLaj(t')>a,] tt =i 

But PLA (t")>a,] is just the shaded portion of Figure 4.2.2, 
and is simply (1 - a,). Then 


Pla > ao] =(l- a,)” (2) 
equivalently Pla < a, =1- (1 -a,)" 
and Pi(a,) = ase Pla < al 

P,(a,) = n(1 - a,)9-4 (3) 


Equation (3) 1s the probability density of a, given that n 
are in the queue, Figure 4,2,3 shows P,(a,) for a few values 


of n, 
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Figure 4,2,3. Shortest Access Time Distribution. 


By the definition of conditional probability: 
Pan (8p) = Pa (90/0) Py (n) 


where N is the random variable of the number in the queue, 


which takes on values n, Then 
n-1 
Pay (a, 0) = n(l i a,) Py (n) 


The mean access time, & , is 


- = 1 n=] 
ae aan ; a on(1 - a) Py (1) da, 
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Integration by parts over a, leads to 


= oe 
ae & 


i 
ney ml Py (n) (5) 
The second moment, a2, is 
z _ 2 J 92 n-1 
a“ = ah aon(l - a,) Py(n)da, 


Integration by parts over a, leads to 


a? 2) met ae Pal) (6) 


The variance of the access time distribution is then 


a eae 2 
Sgr eo 8 = Ete) ea) Pl) 


- Cary gtr Py(nd)® = (7) 


n=l n¢1 
Note that we have not specified Py(n)s the distribution of 
the number in queue. Note too that it is not the same as the 
time distribution of n. It is the distribution of number 
in queue as seen by the departing requests--we need Py (7) 
taken over instants when the next request is extracted from 
the queue, which does not happen at uniform intervals. It 
is a reasonable assumption*® that Py(n) 4s a normal distribution. 
This is only an approximation, since the normal distribution 
would allow for some probability of negative n, which is 
physically meaningless; this. must be used carefully when 


*Based on the Central Limit Theorem, 
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n *gs small enough so that the portion of the normal curve 


extending below n=0 1s appreciable, especially when the variance 


of Py(n), oe is large, so that 0, 2>N, Py(n) ts 
1 =\2, 2 
Py(n) = Tire, expl-#(n - n)“/o 7 J (8) 


Putting (8) into (5), 


ax £ mee 
n=] Jeno, 


Ten expl-#(n - n)*/o? J (9) 


And putting (8) into (7), 


2 2 2 i: 3) J/g 2 
i ae ee en 8) 


Equations (9) and (1) cannot be reduced further, even if 
the summations are taken to be integrations over the infinite. 
interval. These equations do, however, yield readily toa 

comouter, and families of curves for a and o2 
assembled and are show in Figure 4,2,4 and 4.2.5. The 


have been 


axes are normalized so that, given the drum revolution time 
T, values of access time oan be found, 

We wish to note the limiting forms of equations (9) 
and (10). These occur for n >> 1, and for a, << n, Figures 
4.2.4 and 4.2.5 show that for n > 8 we may ignore the effects 
of c,, for o, of interest (see Section 5,2), with only a 
small error. Now if on is very small compared to n then 


the normal curve approaches a unit impulse in the limit, 
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and the summations of equations (9) and (10) reduce to a 


single value, taken at n =n. Thus for o, << n: 


- 1 a 
a= i atn=n (11) 

1 _= 
9, = mI J n/(m¥2) atn=n (12) 


It is to be noted that (11) and (12) are evaluated at 
ns n, and that the approximation is very good if the conditions 
are mét; this is evidenced in Figures 4.2.4 and 4.2.5, where 
equations (11) and (12) have been draw, One of the prime 
assumptions of this derivation is that the drum positions 
at successive request-granting times are independent. If the 
drum positions at successive request-~granbing times are not 
independent, then the access distribution is in error, This 
is the case if the average length of requests is small compared 
to a drum revolution, See the discussion on page 63. 

In Appendix 2 one further result of interest is 
obtained. The form of the probability density for the waiting 
time in queue is derived and is shown to be exponential. This 
is in excellent agreement with the simulation results discussed 


in Section 5.2. 


Figure 4,2,4. 
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4.3. Examination of Shortest Access Time Queue. 


The solution to a queueing problem in which the pobicy 
is based on a continuous number of priorities, such as the 
shortest job first and shortest access time queues, is not 
easily obtainable. In particular no solution has yet been 
obtained for a finite requesting population, under a shortest- 
job-first type queue discipline, since the arrival rate of 
requests tends to depend heavily on the size of the queue 
and the service time. As the queue become full, the rate 
of arrival of requests tends to slacken because there are fewer 
members of the requesting population outside of the service . 
system, In this section we will derive a set of approximate 
equations for the number in the queve as a funetion of input ~ 
and service parameters, and indicate an iterative procedure «. 
for solving them, 

We suppose that the queve is in statistioal equilibrium, 
that is, the system has been in operation sufficiently long 
that the time average of number in the system is constant. 

We shall use the following notation: 

n= the mean number in the queue. 

Wy = mean wait on a request in the queue, 

T = drum revolution time, 


T, = mean service time, 
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T, = mean transfer time, 


Tt, = mean access time, 


@ = mean arrival rate, 
b = mean service rate. 
A = mean interval till the next request from one 


process, from the time it resumes, 

Ss = mean number of pages per segment. 

m= number of sectors around the drum, 

N = population size, i.e., the total number in 
the aueue, plus the number in service, plus the 
number generating requests, 


ry = traffic intensity ratio, i.e., the average 
number of busy channels. 


In the previous section we saw that die to independence 
of requests, random segment lengths, andrandom present drum 
position, that at each request-granting time, each request 
was equally likely to be next out. We have a series of 
Bernoulli trials, then, with a probability of : of a particular 
request being picked at a given trial, and probability 
(1 - 2) of being overlooked, where k is the number in the 
queue at the time of the trial. On the average we can say 
that the probability of being chosen on any trial is approximately 
i, where n is the average number in the queue, Therefore 


the probability of being chosen on the «oh 


requesting~grant ing 
time after a request enters the queue is, on the average, 


given by a geometric distribution, which we denote by P(k). 
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Then P(e) = (1 - 2)¥°1 (2) (2) 


We wish to determine the waiting time of a request in the 
queve, The z-transform of equation (1), which we denote 
by py (z), is 

Pitz) = Ba = Bt) ot 


which can be reduced to the closed form 


t 
Py(z) = Sete CEA (2) 
The mean number of trials before the given request is next 
out is 
T dt 
Keay My(2)Je 73 (3) 


And the variance is 
a2 ‘ 
yw LSS weed +X rk led - CE pe (2))*) 
oy = n(n - 1) (4) 


The average wait is just the average number of service 
intervals that must pass while a request is in the queue, 
If a request arrives just before a service begins it must 
wait only (n = 1) intervals; if it arrives just after a 
service begins, it must wait n intervals, as given by 
equation (3). On the average, then, it must wait (n - 4) 


service intervals, The watt in the queue is therefore 


Wy = (n - #) 7 (5) 
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We suppose that each interval is of duration T,. where T, 
is the mean service time, and 

T=T7 +t 
and where T, is the mean access time, For n in the queue, 


we can use equation (11) of Section 4.2, so that 


T +T, (6) 


o * mal 
It 18 reoalled that T, = T/(n¢1) is an approximation, becoming 
more accurate with increasing n, The mean transfer time is 
the time to service the mean number of pages per segment, 
which is 

T= 22 (7) 
where s = mean number of pages per segmerit, 


m = number of sectors around the drum, 


By putting (7) into (6) we obtain 


tT, = 1 stro & ) (8) 


8 n+1 
We have noted that the shortest access time queue is a 
random output queue, so that we can use the result of 


Appendix 1, which says that 
Mean number in the service system = mean number _ in queue 


mean service rate mean arrival rate 
(9) 
where the mean service rate is b = 1/(mean service time), 


and the mean arrival rate is a = 1/(mean arrival time).* 


S SannEEnE EEE 
We are assuming as in Section 3.2 that arrivals are Poisson, 
and that segment lengths are Poisson distributed. That is, 
the probability of exactly k requests in a time t is 

k .-at 
P(k,t) = (at) e t 20 
rt) = (ate 


: (continued) 
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Equation (9) 1s exact only when the arrival and service rates 
are independent of the number in the queue, which is not 

the case in the finite population system we are discussing. 

We can use equation (9) because there must exist an equivalent 
infinite-population system whose equilibrium arrival rate is 
the same as the arrival rate to the shortest access system 
when it 1s in equilibrium, We proceed to substitute the 
appropriate quantities into (9) and then solve for n, the 
mean number in the at equilibrium, 


First note that r, the traffic intensity ratio, is 


where a is the arrival rate at equilibrium. The probability 
of finding exactly k segments in a block of n pages is 


k 
P(kyn) = {/g) .-n/s n=1,2,3, +0 


where s is the mean number of pages per segment. The 
waiting time between poisson arrivals is 
P(t)dt = P(no arrivals during time interval t) 


X P(one arrival in time interval dt) 
k 
= fat) g-aty ig (ad (at) 


at 


so that P(t) = ae™ t>o, 
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also the time average number of busy channels: 


yr = & ~ mean arrival rate 
be = mean service rate 


where co = number of channels, 

b = mean service rate 

a = mean arrival rate, 
The mean number in the service system 1s just (ntr). Now 
if the interarrival time for one working process is A, then 


at equilibrium it must be, for all working processes, 


A 1 
(Wo-n-ry "a (10) 


Because (N=-n-r) are not in the service system, and are therefore 


making requests, We can now f111 in (9) to get: 


n+r n ° 
= (11) 
1/t. “(N= n= sy 
ss A 


We define a quantity R to be 


T ; 
Be ae er ore ree 
cleal cA cA ( ml - n ) (12) 


Note that R is an intensity ratio for one process, and 


T, =RAc (13) 


Then the intensity ratio r is 


T 
a s 
r=t> = —2R(N-n - 1r) 


Solving (14) for r, we find 


r= (N-n) pEy (15) 


SR a Rt A RR ET TN TARA Ne ER 
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After putting (15) into (11) and performing the appropriate 
algebraic manipulation, we find 


n = ———_1___,__ (16) 
1+ ie aay 

The form of equation(16) hes been chosen because it is 
solvable by a process known as relaxation (or 4teration), 

in which a guess at n is put into the right side of (16), keeping 
in mind that R = R(n). A new value of n is obtained, This 
new value of n is placed into the right side of (16) as 
before, yielding yet another value of n. This process is 
continued until the new value of n is the same as the previous 
value, It was found that (16) converges rapidly, within 

five cycles. 


Collecting the results, 


N 
i IMME TROIS 1S 1 
a (+B) (178) 
Wa = (n= #) RAc (17) 
T, = RAC (17¢) 


A simulation has been carried out to test equations (17). 
The value of n was found to be within 1% of the simulated values; 
the value of Wy was within 3%, These answers were considered 
satisfactory in view of the approximate nature of the 
derivation, 


Due to the nature of this problem we are unable to 
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say anything about the standard deviation of our results, 
Simulation has shown that the standard deviation of the number 
in queue is less than 1.0, while the standard deviation of 
the waiting time was in general somewhat larger than the 
mean, In particular, one simulation reported a maximum 
wait of about ten times the mean, 

As a final note we want to point out that one of the 
basic assumptions of this section and the previous section 
is that the drum position is random at each request-granting 
time, This meansthat the drum positions at successive request- 
granting times are independent, But this need not be the 
case. Suppose for instance that the transfer time, The is 
a small fraction of the drum revolution time,T (for example, 
suppose the average transfer time, T, » 0.17). Clearly, 
if this 1s the case, the drum positions at sucaessive request- 
granting times are dependent, because we can say that the 
probability of the drum being only 0.17 away is much greater 
than being, say, 0.5T away. This is obviously contradictory to 
the assumption of independent drum positions at successive request. 
granting times, Consequently we expect the access time to be 
below the predicted values, since the probability of finding a 
request wanting the present drum position is greater than 
if the drum position is random, If the access time were 
smaller than the predicted values, then both Wy and n would 
be smaller than predicted, T, would be smaller, and the 
system operationshould be more efficient, Simulation has 


shown that this is the case, that efficiency is increased 
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when segments are short, In particular, since (N=n-r) 
processes are working, then the fraction of processes that 


are working is 


(N_- 2 =r). N-2n (18) 


If n substantially decreases, by (18) the efficiency sub- 
stantially increases, The greater the efficiency, the 
greater the number of processors that can be kept busy, It 
is to be noted that when Tt, is of the same order of magnitude 
as T, or larger, then the drum positions at successive 
intervals become independent, and the analysis of this 


section is valid, 
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CHAPTER 5. IHE SIMULATION RESULTS. CONCLUSIONS. 
5.1. Introduction, 


In order to observe the operation of the model of 


the entire drum system, which is discussed in Chapter 3, 
it was decided to simulate the system. Project MAC computation 
facilities were used; the simulation was written in SIM, a 
new simulation language conceived and implemented by A.L. Scherr 
at Project MAC (17). SIM is an augmented version of the 
MAD programming language, adding several new statements to 
those already existing in MAD. It has the powerful advantage 
that the logical flow of the simulation is the same as the 
logical flow of the actual system, Each element of the system 
(see Figure 3.1), namely the processes, the queue, and the 
drums, is specified in the simulation as an Element (which 
is translated into a MAD external function by a SIM pre- 
compiler). The inter-elemental signals shown in Figure 3.1 
are implemented in SIM by system variables, which allow a 
signal to be transmitted from one element to another. A 
main program called SIMSYS coordinates the activity of the 
elements. 

Three simulations were run, One was a simulation of 
the entire drum system discussed in Chapter 3, Another was 
a@ simulation of the shortest access time queue discussed 
in Chapter 4, Section 5.2 discusses the drum simulation, and 
Section 5.3 discusses the queue simulation, A third simul- 


ation was used to develop Appendix 2, and is discussed there. 
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5.2. The Drum Similation. 
The three elements of the simulation were the Users’ 


Processes, the Queue, and the Drums. These elements and the 
signals that were passed among them are shown in Figure 3.1. 
The logical flow of each element is the same as shown in 
the flow graphs of Figures 3.8A, 3.8B, 3.10, and 3.12, where 
the models of the Processes, the Queue, and the Drums are 
depicted, 

CTSS has available a random number generator, which 
is useful in the simulation of the Processes to generate 
the probability distributions discussed in Section 3.2. 
The random number generator returns a number between Zero and 
one from a uniform distribution, This can be used to get 
numbers from other distributions in the following manner. 
First the cumulative distribution of the given distribution 
is found, which will have probabilities varying between zero 
and one. The random number generator can be used to select 
one of these values of probability. This value is substituted 
into the cumulative distribution which has been solved for 
the random variable, In the drum simulation numbers from 
exponential distributions were needed. Such exponentially 
distributed random variables can be obtained in the following 
manner, Suppose we want to select a random number from the 


exponential distribution of interarrival times, which has 


67 


been shown to be 
P(t) = aeW8t b>o0 (1) 
Denoting the cumulative distribution by Q(t), we have 
Q(t) =s ae@* at el-e (2) 
0 
Solving for t, 
t =~ dan (1 - at) (3) 


But in Q(t) all probabilities in the interval (0,1) occur 
uniformly, so we can use the random number generator to 
select a probability Q(t); substitution into (3) yields 
the desired exponentially distributed random variable, t. 
Equation (3) was used in the Process Model to select 
waiting times til the next request, and to select the 
number of pages in a segment. 
The following data were taken during ea typical 

simulation, for each queue discipline: 

(1) per cent process idle time; 

(2) waiting time in the queue; 

(3) number in the queue; 

(4) service times; 

(5) access times; 

(6) channel idle times; 

(7) number of fields used per sector on the drum. 


The following set of parameters was considered typical. 


SOT eee eR aN a EET 2 BEE IT SO UR TET te 
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Fractional drum occupancy........ceeces 90 
Number of processeS,.....seeccccesseee 20, 

Mean intererequest time....ccccccscces 15. msec, 
Mean number pages per segment......ec- 10. 
Read-write ratio... .csscseersecvscccccce 3. 
Number of drumS.,......cecccrecccsccecs 36 
Number of channels each drum.......ee6 3. 
Number of fields each drum........220. 2564 
Number of sectors on drum.....csccoees 64, 
Number of words per page..csscccesesee 64, 

Drum revolution time..cccccesceccsoeese 16.7 msec. 


The following per cents of process idle time were found for 
each queue @isciplines 


First come first served...ccecccscecees 55% 
Shortest job first..cccccccccccscccces Yt 
Shortest Access time first.........06. 41% 


Other simulations using modified sets of parameters 
(for example, two drums with two channels each; or longer 
service times, that is, more pages per segment) showed the 
same result--the shortest access time queue discipline results 
in minimum idle time. This point has been discussed under 
our comparison of queues in Section 3.3.5. 

Probability distributions of all data were taken. 
Three of them were of particular interest, and are reproduced 
here. These were the waiting time in queue, the number in 
queue, and the number of fields used per sector per drum. 
These are plotted in Figures 5.1, 5.2, and 5.3 for each 


queue discipline, The means and the maximum points are 


indicated, It is notable that the mean wait for First Come 
First Served was 17.1 msec, while for Shortest Access Time 

Pirst and Shortest Job First it was significantly less, 

6.8 msec for Shortest Job First end 6.3 msec for Shortest 
Access Time First. Again the Shortest Access Time Queue 

lead to the minimum wait, It is also of significance that the 
shape of the waiting time in queue distritution is exponential 
as predicted by Appendix 2. The number in the queue (Figure 5.2) 
was about 8 for First Come First Served, and half that for the 
other two queue disciplines, The number of fields per 

sector per drum (Figure 5.3), 1s not dependent on the queue, 

but is dependent only on the deletion volicy, which is shown 

in Figure 3.12. It 4s interesting to note that it is 

normally distributed, and that at desired occupancy level of 
90% the maximum data point was 242 out of 256 fields used (95%). 
The mean was 230 fields used (90%). ‘Tnis wag for a sample 

of 24,500 points. We conclude that ocoupancy levels in excess 
of 90% can be maintained without overflow. 

The remaining three distributions are not plotted here, 
but we will discuss each briefly. The service distribution 
was found to have approximately the same shape as the 
number of pages per segment distribution, but it was distorted 
due to the inclusion of the access. time in the service time. 
The mean service time was found to be the sum of the mean 
access time and the mean transfer time, as expected. 


The access time distribution was uniform for First 
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Come First Served, with a mean of 16.7/2 msec = 8.34% msec, as 
expected. For Shortest Access Time First this distribution 
was found to follow closely the predictions of Section 4.2. 
The access distribution for Shortest Job First was somewhere 
between the First Come First Served and Shortest Access Time 
distributions, as expected. 
Finally the channel idle time distribution showed 
that there was an insignificant amount of channel idle time, 
Let us mantion whet the maximum waits in the queue 
were, First Come First Served had the smallest maxinun 
wait, as expected, and Shortest Job First had the largest. 
Some numbers are, for the typical parameters listed on page 68, 


Pirst Come First Served........02 62. msec 
Shortest Acoess Time.......ceseee 65. mgec 
Shortest Job First.......eeeeee0-e 100, msec 


Note that the Shortest Access Time does not cause waits 
too much longer than the First Come First Served Queue. 
Other simulations were run, in which the Shortest Job 
First queue was observed to have a maximum wait of 4 sec, 
for parameters not too different from the ones listed on 
page 68, 

A last point: queues in which the skip limit* 
was used hove a “First Come First Served" component, and 
are accordingly less efficient than a Shortest Access Time 
queue, A skip limit of ten in a Shortest Access Time queue 


caused its effieiency to be only slightly greater than 


*Section 3.3.4. 


300 - 


First come first served: mean=17.1 msec, max=62.4 msec. 
Shortest Access time: meane6.32 msec, maxe63.9 msec. 


Shortest job first: mean=6,84 mseo, max=100.6 msec. 
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Figure 5,1. Probability densities of waiting time in queue 


for various queue disciplines. 
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Figure 2. Probability densities of number in queue 
for various queue disciplines, 
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Figure 5.4, Probability density of number of 
fields used per sector. 
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a First Come First Served queue. 


5.3. Shortest Access Time Queue Simulation. 


This simulation was composed of two elements, one 
to make requests, and the queue, With Section 4.3, the 


arrival rate of requests at equilibrium is 


a= titan (4) 
where 

re S(t +!) (5) 
and = drum revolution time, 


inter-request times per working process, 
number of channels, 

= mean number of pages per segment, 

= number of sectors per drun, 

= mean number in the queue, 


T 
A 
c 
8 
m 
n 


The simulation was seeking to test equations (17) for 
Section 4.39 which are 


N 

a ae oe (6) 
es 2) 
1+ Re Te R 

W = (n-#) RAc (7) 


Four single-channel (c = 1) simulations are considered here. 
The parameters were? 


Parameter 


set number —_ 1 a 3 4 

f 16.7 msec 16,7 msec 16.7 msec 16.7 msec 
A 15.0 msec 5.0 msec 0.5 msec 1.0 msec 
8 4.0 8.0 40.0 2.0 

m 64.0 64.0 64.0 32.0 

N 20,0 10,0 25.0 5.0 


75 


The results, were, for simulation samples of about 3000 data 


points, as follows, 


Parameter a Wq 

set number predicted simulated predicted sjmilated 
1 12.85 12.97 27.78 msec 26.91 msec 
2 7.96 7.99 29.48 " 29,01 "“ 
3 24.00 23.68 260.98 “ 259,12 " 
4 3.98 4,00 15.30 " 16.54 * 


It is apparent that the agreement is good, 
One last point: in Section 4.3 it was mentioned that 
1f the drum position is not random, that is, when short 
segments were used, then the access times should decrease, 
and in particular the number in the queue and the waiting 
times should decrease, The following simulation verified this: 
Parameters: 


T « 16.7 msec 


Az 7,0 msec 
s = 3.3 
m= 64,0 
Resu : Random Function of time 
n W n W 
qa q 
3.86 17.11 msec 2,48 11.40 msec 


There is a significant difference, and fortunately the 
errors are in favor of much increased operational efficiency. 


From Section &,3 the efficiency is 


N-n 
N(l + B) 
Efficienoy: Random Funetion of time 


] 32% ws 46% 
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5.4. Conclusions. 
In this paper we have shown that for a segmented 


multiprogrammed, multiprocessor computing system, the 


following is trues proper maintenance of auxiliary memory 


can greatly improve system efficiancy, We have shown how 


this can be done. In particular: 


(1) 


(2) 


(4) 


It 1s generally possible to store pages consecu- 
tively on the drum, and proper deletion policy 
can be used to maintain occupancy levels in 
excess of 90%, 

The Shortest Access Time queue discipline is 

the most efficient queue for an auxiliary 

memory, where time is spent waiting for mechanical 
parts to move into some proper position. 

If request sizes are large, that is if segments 
contain many pages, then it is not difficult 

to derive equatiors for the average number in 
queue, and for the average wait in the queue. 

If the segments are short, these equations break 
down, but provide an upper limit for the average 
number in queue and the avevage wait in queue: 

The error is in favor of inoreased efficiency, 

A reasonable probabilistic model for the processes 
in a segmented computing system has been given 

in this paper, 
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(5) Simulation is a particularly useful tool for 
analyzing problems of the complexity of computing. 
systems, for it is frequently helpful in providing 
a starting point for analysis. 

(6) Mixed-Policy queues may be used in drum (or 
disc) auxiliary memory systems when we become 
concerned that some requests might hive to wait 
inordinately long. A “skip limit" queue was found 
to be more efficient than a “window" queue (see 


Appendix 2). 


5.5. Suggestions for future study. 
(1) The deletion policy of the Processes model. 


Although it is possible to prevent drum overflow, 
and to maintain 90% occupancy, exactly what deletion 
policy is the best, if any? See chapter 2 and 
Section 3.4 for discussion. 

(2) The “page-turning" vs, “segment-turning"™ allocation 
problem of Chapter 2 should be considered in 
detail, 

(3) The finite population, shortest job type of 


queue is yet to be completely analyzed. 


APPENDICES 
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APPENDIX 1, THE POLLACZEK-KHINTCHINE FORMULA. 

In this apvendiz we will derive an equation which 
Saaty (16) refers to as the Pollaczeké#Khintchine Formula 
(Saaty, pp 40-43). Saaty has derived it for the poisson 
input, single channel, equilibrium: queue. We will extend the 
reasoning to imelude the c-channel server. Since we talk 
only of the number in the system, the queve discipline is 
irrelevant to our discussion, until we begin to talk of 
waiting times. 

Suppose that arrivals occur at random according to 
@ poisson process at a rate a per unit time, to a waiting 
line in statistical equilibrium, before a o-channel facility. 
They are served according to some arbitrary service-time 
distribution at a rat b per mit time per channel, We assume 
that if the service rate of one channel is b per unit time, 
then it is be per unit time for all c channels operating 
together. Suppose that a departing request leaves q in the 
system behind, including those in service, and that some time 
t will elapse before the next departure. Let the waiting 
line increase in length by k requests during this one 
service interval, If the next departing request leaves q' 


behind in the system, we can relate q and q* as follows: 


q’ =e mx(q-1,0)+k2=q-ele+da+k (1) 


where d(a) =O if q >0 
d(q) =1 if q=0 


By introducing d(q) we eliminate the max expression. 
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We assume that equilibrium values for the first and 
second moments E(q) and E(q°) of the number in the system 
exist, where we are treating q as a random variable. We note 
that E(q) = E(q*) and E(q”7) = E(q**) since both q and q! are 
assumed to have the same equilibrium distribution. We observe 
that since equilibrium, eaeh departing request must leave 
behind identical time-independent queues, each having the 
same probability distribution. Now, from the definition, 

a? = d, and q(1 - 4d) =q. Thus, taking the expected value of 


(1) we have 


E(q*) = E(q) = E(L) + E(d) + E(k) (2) 
but since E(q) = E(q') we have 
E(d) = 1 - E(k) (3) 


During an inter-departure interval of length t we have 


co k 
E(k) = £ xlat) e78t = at (4) 
k=o 


E(k?) = r x? fat)" ees (at)? + at 
=~ " (5) 
Let us denote the combined service distribution for all 
ce channels operating in parallel by S(t). Taking the 
expectation of E(k) with respect to this service time 
distribution we see that 


E(k) = (at) S(t) at 


= af & S(t) at (6) 


E(k) = =r 
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since the mean of S(t) is 1/bc,. But since se have not 
specified S(t), E(k) = r is unaffected by the type of service 
@iatribution. Then 

E(d) =l-r (7) 
Now, if the probability of the queue increasing by k is 
independent of the length of queue, q, and of d, which 
depends only in q, any expectation over products of r, q, 
and d is just the product of the respective expected values. 
Therefore 


E(k?) = Fae)? # at) S(t) at 


which is an average over all time. But 


E(k?) = (ae)? S(t) at + f (at) S(t) at 
0 


272 


zaX t© + at. 


But the variance of S(t), 32 » is 
2.42. <2 

Se. #49 X 
Therefore 

E(k?) = am(o2 + ¢*) + at. 
Finally, 

2 2.2, 2 rs 
E(k°) =a°oS +r°4+r r= (8) 


If we square both sides of equation (1): 
q’? =(q-1)7 +4 2(q - 1)(a +k) # (a + x)” 


qt? « a? - 2q(1 = k) + (k - 1)" + a(2e - 1) (9) 


82 


Equation (9) was obtained by using qd = 0, and a™ = d. 
Because of equilibriun, 


E(q*) - E(q'*) = 0 = 2E(q)E(k = 1) + E((k - 1)”) 
+E(a)E(2k - 1) (10) 


Recall that the validity of (10) depends on the independence 
of q and k, Solving for E(q) and using equations (6), (7), 


and (8), we have the Pollaczek-Khintchine Formulas 


2 
= + E « 
e(q) = Eik=2)) + Bld E(2k = 2 
a*o? +re4¢r-2re¢1le (l = r)r = (1 - r) 
2-< 2r 
r? + aa? 2 
E(q) =r + 377) ft he (11) 


Thus, once we know the variance of the service time distribution, 
the average number in the system, E(q) is determined, It 

4s 1tmportant to note that E(q) is an average taken over 

instants just following departures, and is not the time 

averace, If E, (a) is the time average, all we can say 

without further argument is that 


E(q) < E, (a) < E(q) +1 


In general the average number in the service system equals 


the sum of the average number of busy channels (here it is 


re to ) plus the average number in line. 


To obtain the average wait in the waiting line, which 


we will denote by E(w), we observe that a(E(w) + + ) is the 
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expected number of arrivals during the expected time of one 
request in the service system, if the queue discipline is 
first come first served, because * is the mean service time, 
But this must be just the number in the system immediately 


after a customer departs, that is, E(q), so 


a r* + a7o2 
aE(w) + oo 5° + SSR 
but r = &, so 
bo" 2? + aa? 
Wa = E(w) = tall - F) (12) 


We have pointed out that r is just the number of busy channels 
and that E(q) is the expected number in the system, Inspec- 
tion of (11) will show that the number in line, Ly must be 


r* + a*o 2 
Ly = 201 - r) 
We have the interesting and important result 
pact 
ae (13) 


Notice that this is exact only if the number in the system, 
E(q), 1s independent of the service time or the arrival 

rate, as pointed out after equation (10). We also note that 
if (Wy + ) is the time of one customer in the service 
system, then be(W, + = ) 4s one more than the number in 

the system, E(q). This is so because if E(q) are in the 
system, then E(q) - 1 service intervsls pass while one request 


is in the system. Therefore bow, = E(q) and we have the 
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second result 


w = Bia) (14) 


In words: 


w= averace number in the line 
q average arrival rate 


_ average number in the system 
“ awerage servioe rate 


These are true for arbitrary service distributions, 
It is interesting to note that if the service times 
are exponential, that is, the service follows a poisson 


law, then the interval between departures is given by 


S(t) = poe" et 


t > 0 
It is a well-know fact that for this type of distribution 


the variance equals the mean squared, that is, 


of =f tre Pet at - [s*tve eet at]? 
0 


" 


B® 

= (1/ve)? 

Substitution of this into (11) yields 
2 

L, Pe 2 sa (15) 

From which it follows that 
2 
Ly ii Bae (16) 

and E(q) = i z = (17) 


Consider for a moment the geometric distribution 


P(k) = r“(1 = r) (18) 
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It is known that this distribution describes the number 
in a service system with exponential input and output 


(Saaty, 17, pp 38ff). The expected number in the system is 


e k 
L= &£ kr (l-r) 
k=0 
a: Jk 
=(l-r)r= tr 
ar k=1 
_ —2 
“jer 


which is the same as (17). Then we can find the variance 


of (18) which is 


eo ao 
rk rk (l-r) - i’ s (l-r)r Sry 5 r* ie ig 


k=0 dr ar k=0 
2 
of = —Z— +—_, 
(1 - r) (1 - r) 
os L(L + 1) (19) 


We have the result that for the exponential input, exponential 
output system the number in the system ts given by (17), 

the number in queue by (16), and the variance of the number 
in the system by (19). The results of this section will hold 
for queues in which the discipline is random as well as for 
first come first served, They hold for random disciplines 
because, on the average, the number of service intervals 

that must pass before service is the same as for first 

come first served, This 1s seen in Section 4.3, In fact 

the equation for the mean number in the queue, L, 4s accurate 


if the following conditions are satisfied: 
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(1) all requests stay in the queue until served; 


(2) the service time distribution for all channels 
is the same, with parameter b; 


(3) channels serve one at a time; 


(4) a channel serves the next request, if any are 
are waiting in the queue, as soon as it finishes 
with the last request, 


A little thought will show that if these four rules hold, 
the length of the aueue is the same for all disciplines, 
although the mean wait, Wy? will vary. (Morse, 13, p. 117). 
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In this appendix the probability density funotion for 
the waiting time in a shortest acess time queue is derived, 
We define the following random variables: 

A= r.v. of access time, taking on values a, 


N = r.v. of number of requests to exit the queue 
before a given request exits, taking on values n, 


P = r.v. of number of pages per segment, taking on 
values p. 


R = r.v. of number of requests in the queue, taking 
on values r, 


T= r.v. of transfer time, taking on values t. 
Wer.v. of waiting time in queue, taking on values w. 
Since at each trial (request-granting time) all requests 
are assumed to be equally likely to exit next (Section 4,2) 
the distribution of N is geometric. As on page 58, 
equation (1), the conditional distribution of N given that 


BR are in the queue is 
1yn-1(2 

Pyyg(t/r) = (1 = 2)9*(2) (1) 
where R is the random variable of the number of requests 
in the queue. Denoting the density function of R as Pa(r)s 
We are interested in the wait in queue, so we have defined 
the random yariable of wait to be W. Then 

Pump (¥>sr) = Py /ug(¥/rer) Py s_(n/r) P,(r) 


(3) 


For a single channel queue the wait in the queue is N access 
times plus N transfer times. As in Section 3.2 we assume the 


number of pages per segment to be a random variable, P, where 
Pp(p) = op e7°P (4) 


and c is a constant proportional to the mean number of pages 

per segment. If T* is the drum revolution time and § 

the number of sectors around the drum, then the transfer time 
for one pages is T*/S8, Denoting the random variable of transfer 


time by T, we have for the density function of Ts 
Pr(t) = Kt eK (5a) 


with the constant k defined as 


k= 2 (5b) 


and N is the average number of pages per segment. The wait 


in the queue is, from above 

W=N (A4+T) =NA+NT wey +z 
with y = NA and z = NT, 

From Section 4,2, the cumulative distribution of the 
access time is 
: i ss N 

PCA <a.) = 1 - (1 = a) 
But y = NA. Then 

My <a) = Pas fi) =1- 1-6)" 


ae Cire (6) 
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Now let u = -a/N. Then 
Ply <a} e1{(1+ ui” 9 (7) 


For large N, u approaches zero and we know 


ere (1+u™ =e (8) 


Thus for large N$ 
Ply <a] ws 1 -e™* 


and the density function for the access time component 


of the wait in queue is 
a -a 
Py (a) lat 7% PLy < al se (9) 


Using an elementary probability transformation, the density 


function for z = NT is 
P_(v) = #P,( 2) « rela en (kb) N 
Zz =n ity’ * 2 
Defining K, = k/N = 8/T'NW, we have 


ee (10) 
Pi (>) Ki be 


Since A and T are independent random variables, the conditional 
density function for W, given N and R is 


Lo] 


Pysigi™) = f Py(wex) PL (x) dx 


the convolution of Ra) and Pi(b). This evaluates to be 
2 
_. 2 o-W 


(K, - x )* 


*this approximation is surprisingly good for N > 10. 


90 


Recalling equation (3), 


Putting (1) and (12) into (3), 
2 


Paya ener) = C eT em WL (a-2)"-7 (2) ICP, (r)] 
= 


2 
k = 1,n-1 
Fagen ome) = TE On ag GY Bt 


If N is large, as it is assumed to be, then P,(r) is 
approximately Normal by the Central Limit Theorem, and 


2 m\e2 2 
Puy (Weer) —(k/n)” (1-2)9-1 (2) lL yo ( rR)" /20, 


((k/n)-1)7 v Lig, 


(13) 


which is the required joint density function of waiting time 


in the queve. The simulation has shown that for the mean queue 


length, R, greater than about 10 with o <8 (which is the case 


when R > 10) this approximation 1s valid. Thus in the 
steady state situation, it is clear that the probability 


density for waiting time in the queue is approximately 


exponential, a fact verified by simulation (Section 5.2). 
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APPENDIX 3. DESCRIPTION OF A MIXED-POLICY QUEUE. 

The queue described in this section has been proposed 
as a shortest access time queue, but one for which we are 
concerned that a particular request may be continually over-= 
looked due to the random nature of selection, Consider for 
example a queue which has many requests in it (at least thirty). 
Such a queue might occur if it were decided to request 
pages singly instead of in segments, In Section 4.3 it 
is shown that the waiting time of a request until it leaves 
a random output queue is given by a geometric distribution, 
with the expected wait equal ton service intervals, where 
n is the average number in the queue, Now ifn is large, 
then it is conceivable that a request might have to wait for 
a very long time: the variance of the geometric distribution 
is (n)(N - 1) 8s n* ifn is large. 

Consider the queue shown in Figure A3.1. A new request 
is always added to the bottom of the stack, A section of 
the stack, of length N, is considered, the remaining requests 
in the queue being ignored for the while. We shall refer to 
the portion of the queue under consideration as being viewed 
through a window, of size N. The top of the window is 
always at the top of the stack, The requests in the window 
are labelled Ry Rose ee Ry » and are considered according to 
the shortest access time first queue discipline, Whenever 
the request marked Ry is removed, the window is moved down 


until its top coincides with the next request Ry. It 
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2-—- 
Ry ——_e 
R,—— 
: N WINDOW 
ky 
M 


Next in--? 


Figure A3Z,2. Structure of a Mixed-Policy Queue. 


93 


7 service 


appears that R, might have to wait until the N° 
time before it leaves, but no longer (by then it would be 
the only request in the window); thus it seems that an upper 
bound can be placed on the waiting time in the window, 
namely (N - 1) service intervals, But this is not so, 
Consider the request marked Bye Suppose by some quirk of 
fate that requests are serviced as follows, Ry Royeees 
Rye Byyy oes eBoy Rye This would happen if Ry sees s Ry iy 
were serviced before By but then the window has become 
positioned at By» and the next (N - 1) requests could be 
serviced before Rye It is clear that the maximum wait in the 
window is 2(N = 1). Since the arrival rate is given by 
an aveFPage, the expected wait before reaching the window is 
Ms an upper bound to the wait is M + 2(N «- 1) service intervals. 
We are assuming M > N, 

To find a lower bound on the waiting time, consider 
the following argument. Suppose a request enters the queue 
just before the window makes a jump of N, then suppose the 
window moves one position at the end of each service interval, 
The request in question would then wait only (M - N) service 
intervals to reach the window. Then suppose it were let 
out immediately, The minimum wait is therefore M - (Nel) 
= (M -N +1). We have set an upper limit on the waiting time: 


Wiay = (M+ 2(N = 1)) t. (1) 
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and the lower limit of waiting time is 


Wnt 


Equations (1) and (2) assume that M > N, 


n= (Me (N-1)) E (2) 


On the average the window is not full. We can think 
of the vroblem as a flow problem, with requests flowing into 
the bottom of the wondow at the rate of one per service 
interval, and filtering out through the window at the same 
rate, Let us imagine one of the requests being tagged so 
that we can keep track of it. If we know on the average how 
far down the window a request moves before it exits then 
we know the mean wait in the window, Simulation of the 
problem for several window gizes was carried out, and it 
was found that on the average the tagged request went 
half way down the window before exiting. Then we can write 


N. - 
Way = (M5) T, (3) 


Figure A3.2 shows the probability densities of a request 

being at various positions in the window. It is to be noticed 
that the tagged request spends considerably more time at the 
upper and lower ends of the window then at the center. 

The standard deviation was found to be 0.8 of the mean, so 

the contention that the request isa ¥ on the average is 

not too certain, This implies that the probabilities of 

Wage and Wain are not small. Figure A3.3 shows the probability 


density of window jumps. The average window jump is about 


5. Figure A3.4 shows the following: the mean position reached 
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by the tagged request, and the mean window movement when it 
moves, both as a function of window size, 

Recall that forefficient access time queueing the 
mean in the queue had to be at least eight. Hence we would 
require that the window length be N> 16, But since M>N, 
the overall queue would have to have an expected length M + N > 32. 

It appears that the use of the minimum access time 
queue without the window, but with the “skip limit" mentioned 
in Section 3.3.5 is better for the following reason. The 
skip limit could be set to an upper limit of 2(N - 1) so that 
the maximum wait for that queue would be the same as given 
by equation (1), but with M= 0. Since the “skip limit” 
queue with the same maximum is longer than the corresponding 
“window" queue, the access time is shorter, and more efficient 


queueing is had, 
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Figure A3.2,. Probability density of position of tagged request, 
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Figure 43.3. Probability density of number entering window 
when it moves, 
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Figure A3.4. A comparison of window movement and average 
time spent in the window. 
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APPENDIX 4, A CONTINUOUS-TIME MARKOV MODEL. 
With Howard (11, pp. 92ff) we define a rate Matrix [A], 
having elements By4° The rate matrix is similar to the 


familiar Markov transition probability matrix except that the 


th 


elements ayy represent transition rates from the 1 to the 


yeh state, The rates are assumed to be taken from exponential 


distributions. A transition matrix, then, is a discrete form 
of a rate matrix, Since we consider an equilibrium system 
the overall rate of change must be zero, Define a state 
probability vector P, where P = LP) sPose++ oPyl and p, is the 
probability that the system has 1 requests in it. Because 
of equilibrium, 

CP]la] = 0 (1) 
We make the following assumptions. 


(1) All requests join the queue and do not leave 
until service is complete. 


(2) Each channel serves one request at a time, and 
does not begin the next request until the present 
request is finished, 


(3) As soon as a channel becomes idle, the next request 
enters service, provided there are some in the queue. 


(4) The queue discipline is first come first served, 
or else random, For any other queve discipline 
that satisfies (1) through (3) the expressions 
for state probabilities and average number in 
line are the same, but the waiting time in the 
queue is not the same. See closing remarks of 
Appendix 1, 


We use the following notation: 


M = the size of the finite number number in the 
total population being considered--it is the 
sum of the number in the service system plus the 
number making recuests, 


pore errr ce 


EN PE EI RETR Sp 
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a@ = mean request rate per requestor, where 1/a = 
mean interarrival interval per requestor. 


b = mean service rate per channel, where 1/b = to 
the mean service time, 


¢ = the number of parallel channels providing service. 


P= the probability of the service system having n 
of the M possible requestors in it. 


If the system in in state n (indicating that n requests are 
in the service system, and that (M - n) are remaining outside 
in the requesting population) then the rate of exit to the 
state (n+l) is nb for n < a and its cb for mo, We have the 


rate matrix 


-Ma Ma 0 0 eee eee 
ie} -be-(M-1l)a (M-1L}a 0 eee eee 
0 2b @-2b-(M-2)a (Me2)Q 600 eee 
(a] = eee eee eco eee eco eee 
coe ees cb -cbe2a 2a 0 
eee one 0 cb -cb-Aa a 
eee one 0 0 eb -cb 
th 
At the c row the matrix is 
eee (CHl)b =(c-1)b-(M-ctl)a (M-ctl)a 0 oe 
eee O cb -cb-(M-c)a (M-c)a ee 


eee O 0 cb -cb-(Mec-l)a .. 
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Because of equation (1) we can write 
~Map, + bp, = 0 


Map. - bp, - (M-l)ap, + 2bp, = 0 


° 
and in general 


(M-nt+l)ap, » - nbp, - (Men)ap, + (n+1)bp =0O £1é<nx<c 


n+1 
(M-n+1)ap,_, - cbp, - (Men)ap, + ebp,,, = 0 o<n<M 


th 


Adding the n~ and the (n-2) eruations, which is equivalent 


to adding adjacent columns in [A], we have by recursion 


P, = Mrp 
Me1 M(M- 2 
i ik Bae = rT Po 
* _ M(M-2) (M-2 Men) _n 
Py = n! r Po 
so that 

M! 
Ph n!(Men)! rp, Oen<o 

M! 1 = 
Ph c!(Men)! i n=-c Po o<_nsl 


where r = - P. is found from the requirement that 


° 
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If h is the average number of processes actually in operation, 


k the number being serviced, and lL the number in line, then 


and 


The 


The 


and 


The 


The 


The 


k¢ehe Ly = M 


because of equilibrium 
= 2 


h = 
cb 


number being serviced is 


cel M cel ( ) 
k= £ nmp+cokp =caoe- £ cen} p 
n=0 7 neo 7 n=0 m 


number in line is 


as usual the waiting time in the line is 


L 
Wo = 
q a 


number in the system is 


M 
L=L+ke# = np 
a n=0 B 
efficiency is 
number of working processes _ Lateale Seated 
M x 
NM 
M- Inp 
a n=0 = 
M 


(5) 


(6) 


(7) 


(8) 


(9) 


(10) 


summations can be evaluated on a computer without too 


much difficulty if the factorials are expressed as logarithms, 


and 


use is made of the fact that 


n 
n! = exp ({ £ in (4)] 
{#1 


SP eke OR INT MI ee he ER RM Stoo ee 
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It is interesting that a closed form for (10) can be 
obtained when there is one channel, 1.e€., when c=l. In this 


case equations (2) become 


M! n, 


Py = (Hen)? r La n=0,1,...5M (11) 
and (4) becomes 
Py * i. r=@ (12) 
M 
>» te rm 
n=0 . 


Then L is the number in the system, and Lerel, is the number 


in the line, 


M 
L=p nM’ n 
o = (ient r (13) 
nso {N-n): 


Consider 
M M 
MeL Men} M$ on Mi nm 
——s= ££ rs ££ r (14) 
Py n=0 Men)! Men=-1)¢ 
expanding (14) we find 


i = M+ M(M-1)r + M(M-1)(M-2)r° + 2... (15) 
ce] 


But 
Pon 2 3 
pl 7 ol + Mr + M(M-1)r° + M(M-1)(M-2}r° + ... (16) 
° 


Comparison of (15) and (16) reveals that 


MeL. (2 . BS 

ilies 3 1) I (17) 
Solving for L, l-p 

L=M- 2 (18) 
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All that is needed to find L is an evaluation of Pos not 


an evaluation of each P, 48 well, The number in the queve is 


le-p 
é oer? 
L=bere-3r-—=5 (19) 
So that the waiting time in queue is 
L M 1- 
Wee Dee Be Po 
qa a a’ a ar 
l-p 
=i ao) oe 
w= S(-—> ) t. (20) 


where ts = 2, the mean service time. 

The interested reader is referred to A.L. Scherr's 
Doctoral Thesis, in which it is shown that Multiprocessor 
time-shared computing systems are in general, accurately 
described by Markov Models. 
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